Optimizing server placement for parallel I/O in switch-based clusters
نویسندگان
چکیده
In this paper, we consider how to optimize I/O server placement in order to improve parallel I/O performance in switch-based clusters. The significant advances in cluster networks in recent years have made it practical to connect tens of thousands of hosts via networks that have enormous and scalable total capacity, and in which communications between a host and any other host incur the same cost. The same cost property frees users from consideration of network contention and allows them to concentrate on load-balancing issues. We formulate the server placement problem on a cluster that has the same cost property as a weighted bipartite matching with the goal of balancing the workload on the I/O nodes. To find an optimal solution to this problem, we propose an O(n 3 2 m(logn+logm)) algorithm, called Load Balance Matching (LBM), where n is the number of compute nodes and m is the number of I/O servers. We also investigate server placement for general clusters in which multiple same-cost subclusters are interconnected to form a large cluster. This class of clusters typically adopt irregular topologies that allow the construction of scalable systems with an incremental expansion capability. Also, due to the limited bandwidth on network links between subclusters, network link contention is a major concern when distributing servers over the entire network. We show that finding an optimal placement strategy for general clusters with the goal of minimizing link contention is computationally intractable. To resolve this problem, we propose a hierarchical strategy that places servers in two steps. First, to minimize link contention, we decide which
منابع مشابه
Reliability and Supply Security based Method for Simultaneous Placement of Sectionalizer Switch and DER Units
This paper presents a new and useful methodology for simultaneous allocation of sectionalizer switches and distributed energy resources (DERs) considering both reliability and supply security aspects. The proposed algorithm defines the proper locations of sectionalizer switching devices in radial distribution networks considering the effect of DER units in the presented cost function and other ...
متن کاملServer I/O Acceleration Using an Embedded Multi-core Architecture
This paper presents a feasibility study on the use of an embedded multi-core system-on-a-chip (SoC) architecture to accelerate server I/O subsystem functions, as an alternative to implementation via finite state machines (FSMs) and hardwired logic. The multi-core solution is significantly more programmable than FSMs and avoids many of their shortcomings. For the purposes of this SoC we use the ...
متن کاملPerformance Evaluation of I/O Traffic and Placement of I/O Nodes on a High Performance Network
A common trend in the design of large-scale clusters is to use a high-performance data network to integrate the processing nodes in a single parallel computer. In these systems the performance of the interconnect can be a limiting factor for the input/output (I/O), which is traditionally bottlenecked by the disk bandwidth. In this paper we present an experimental analysis on a 64-node AlphaServ...
متن کاملEfficient Parallel I/O on SCI Connected Clusters
efficient use of the underlying SCI interconnect is presented. Every process of the MPI application is client as well as server for I/O operations. This allows for a maximum of locality in file access, while the accesses to remote portions of the distributed file are performed via distributed shared memory techniques. A server is only required to manage the initial distribution of the file frag...
متن کاملA Framework for Integrated Communication and I/O Placement
This paper describes a framework for analyzing dataaow within an out-of-core parallel program. Dataaow properties of FORALL statement are analyzed and a uniied I/O and communication placement framework is presented. This placement framework can be applied to many problems, which include eliminating redudant I/O incurred in communication. The framework is validated by applying it for optimizing ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- J. Parallel Distrib. Comput.
دوره 69 شماره
صفحات -
تاریخ انتشار 2009